Segmenting Hashtags using Automatically Created Training Data
نویسندگان
چکیده
1. Hashtags increasingly used to convey the actual message in tweets. Phrases and sentences turned into a hashtag. 2. Word with sentiment may trap inside a multi-word hashtag 3. Noisy and compact nature of language leads to hashtags very difficult to segment; sometimes depends on context. eg. #together; “to get her” or “together”? 4. Can we use carefully auto-segmented hashtags for training? RELATED WORK
منابع مشابه
Segmenting Twitter Hashtags
Social Media Posts On Platforms Such As Twitter Or Instagram Use Hashtags, Which Are Author-Created Labels Representing Topics Or Themes, Toassist In Categorization Of Posts And Searches For Posts Of Interest. The Structural Analysis Of Hashtags Is Necessary As Precursor To Understandingtheir Meanings. This Paper Describes Our Work On Segmenting Nondelimited Strings Of Hashtag-Type English Text...
متن کاملBootstrapped Learning of Emotion Hashtags #hashtags4you
We present a bootstrapping algorithm to automatically learn hashtags that convey emotion. Using the bootstrapping framework, we learn lists of emotion hashtags from unlabeled tweets. Our approach starts with a small number of seed hashtags for each emotion, which we use to automatically label tweets as initial training data. We then train emotion classifiers and use them to identify and score c...
متن کاملSegmenting Hashtags and Analyzing Their Grammatical Structure
Originated as a label to mark specific tweets, hashtags are increasingly used to convey messages that people like to see in the trending hashtags list. Complex noun phrases and even sentences can be turned into a hashtag. Breaking hashtags into their words is a challenging task due to the irregular and compact nature of the language used in Twitter. In this study, we investigate feature-based m...
متن کاملTowards Deep Semantic Analysis of Hashtags
Hashtags are semantico-syntactic constructs used across various social networking and microblogging platforms to enable users to start a topic specific discussion or classify a post into a desired category. Segmenting and linking the entities present within the hashtags could therefore help in better understanding and extraction of information shared across the social media. However, due to lac...
متن کاملHarnessing Twitter "Big Data" for Automatic Emotion Identification
User generated content on Twitter (produced at an enormous rate of 340 million tweets per day) provides a rich source for gleaning people’s emotions, which is necessary for deeper understanding of people’s behaviors and actions. Extant studies on emotion identification lack comprehensive coverage of “emotional situations” because they use relatively small training datasets. To overcome this bot...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016